Comparing Feature Matching for Visual Object Categorization: MAX vs. Bag-of-Words

نویسندگان

  • Rob Wijnhoven
  • Peter H. N. de With
چکیده

In this paper we address the comparison of two feature matching techniques which can be integrated in the HMAX framework. This comparison involves the originally proposed MAX technique and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX technique with 5–15% for small dictionaries up to 500–1,000 features. A second investigation concentrates on comparing the often used hard vector quantization technique and a soft matching score technique for the histogram creation. It was found that the difference in performance is not significant and the scores are often within their standard deviations. Aiming at an embedded implementation such as in a surveillance system, computation power and memory (number of dictionary features) are intrinsically limited, so that the histogram technique is favored over the MAX technique.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparing Feature Matching for Object Categorization in Video Surveillance

In this paper we consider an object categorization system using local HMAX features. Two feature matching techniques are compared: the MAX technique, originally proposed in the HMAX framework, and the histogram technique originating from Bag-of-Words literature. We have found that each of these techniques have their own field of operation. The histogram technique clearly outperforms the MAX tec...

متن کامل

Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization

Bag-of-words model (BOW) is inspired by the text classification problem, where a document is represented by an unsorted set of contained words. Analogously, in the object categorization problem, an image is represented by an unsorted set of discrete visual words (BOVW). In these models, relations among visual words are performed after dictionary construction. However, close object regions can h...

متن کامل

Concept-Specific Visual Vocabulary Construction for Object Categorization

Recently, the bag-of-words (BOW) based image representation is getting popular in object categorization. However, there is no available visual vocabulary and it has to be learned. As to traditional learning methods, the vocabulary is constructed by exploring only one type of feature or simply concatenating all kinds of visual features into a long vector. Such constructions neglect distinct role...

متن کامل

Recognizing in the depth: Selective 3D Spatial Pyramid Matching Kernel for object and scene categorization

This paper proposes a novel approach to recognize object and scene categories in depth images. We introduce a Bag of Words (BoW) representation in 3D, the Selective 3D Spatial Pyramid Matching Kernel (3DSPMK). It starts quantizing 3D local descriptors, computed from point clouds, to build a vocabulary of 3D visual words. This codebook is used to build the 3DSPMK, which starts partitioning a wor...

متن کامل

Understanding bag-of-words model: a statistical framework

The bag-of-words model is one of the most popular representation methods for object categorization. The key idea is to quantize each extracted key point into one of visual words, and then represent each image by a histogram of the visual words. For this purpose, a clustering algorithm (e.g., K-means), is generally used for generating the visual words. Although a number of studies have shown enc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009